Efficient Segmental Conditional Random Fields for Phone Recognition

نویسندگان

Yanzhang He

Eric Fosler-Lussier

چکیده

Recently the initial attempt has been made to use segment-based direct models on their own for phone classification and recognition without the aid of an HMM lattice. This paper follows this line of research to further investigate these one-pass segmental direct models on phone recognition using posteriors as input. We make the first direct comparison between a frame-based system and a segmental system using the same base features, and explore the utilization of transition features in a direct segmental model for the first time. The results show that transition features can be very beneficial, particularly the ones surrounding the segment boundaries. In order to efficiently incorporate such features, we propose the Boundary-Factored SCRF, which reduces the time complexity of a Segmental Conditional Random Field (SCRF) to that of a frame-level CRF.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Segmental conditional random fields with deep neural networks as acoustic models for first-pass word recognition

Discriminative segmental models, such as segmental conditional random fields (SCRFs), have been successfully applied to speech recognition recently in lattice rescoring to integrate detectors across different levels of units, such as phones and words. However, the lattice generation has been constrained by a baseline decoder, typically a frame-based hybrid HMMDNN system, which still suffers fro...

متن کامل

SCARF: a segmental conditional random field toolkit for speech recognition

This paper describes a new toolkit SCARF for doing speech recognition with segmental conditional random fields. It is designed to allow for the integration of numerous, possibly redundant segment level acoustic features, along with a complete language model, in a coherent speech recognition framework. SCARF performs a segmental analysis, where each segment corresponds to a word, thus allowing f...

متن کامل

SCARF: A Segmental CRF Speech Recognition System

We propose a theoretical framework for doing speech recognition with segmental conditional random fields, and describe the impleme-nation of a toolkit for experimenting with these models. This framework allows users to easily incorporate multiple detector streams into a discriminatively trained direct model for large vocabulary continuous speech recognition. The detector streams can operate at ...

متن کامل

CRANDEM: conditional random fields for word recognition

To date, the use of Conditional Random Fields (CRFs) in automatic speech recognition has been limited to the tasks of phone classification and phone recognition. In this paper, we present a framework for using CRF models in a word recognition task that extends the well-known Tandem HMM framework to CRFs. We show results that compare favorably to a set of standard baselines, and discuss some of ...

متن کامل

A comparison of training approaches for discriminative segmental models

Segmental models such as segmental conditional random fields have had some recent success in lattice rescoring for speech recognition. They provide a flexible framework for incorporating a wide range of features across different levels of units, such as phones and words. However, such models have mainly been trained by maximizing conditional likelihood, which may not be the best proxy for the t...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2012

Efficient Segmental Conditional Random Fields for Phone Recognition

نویسندگان

چکیده

منابع مشابه

Segmental conditional random fields with deep neural networks as acoustic models for first-pass word recognition

SCARF: a segmental conditional random field toolkit for speech recognition

SCARF: A Segmental CRF Speech Recognition System

CRANDEM: conditional random fields for word recognition

A comparison of training approaches for discriminative segmental models

عنوان ژورنال:

اشتراک گذاری